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METHOD FOR IDENTIFYING OR ISOLATING A MOLECULE 
AND MOLECULES IDENTIFIED THEREBY 

RELATED APPLICATIONS 

This application is a continuation-in-part of Serial No. 
08/580,980, filed on January 3, 1996, which is a continuation- 
in-part of application Serial No. 08/479 , 328 , filed on June 7, 
1995 , both of which are pending, and are incorporated by 
reference. 

FIELD OF THE INVENTION 

This invention relates to methodologies for identifying 
molecules of interest. In particularly preferred embodiments, 
the invention relates to the identification of molecules 
associated with pathological conditions such as cancer, 
(melanoma or renal cancer, e.g.), Hodgkin's Disease, 
autoimmune diseases and so forth. Also a part of the 
invention are the isolated molecules found as a result of the 
inventive method, such as presented peptides. These molecules 
include, inter alia, protein-containing molecules, isolated 
nucleic acid molecules encoding these, and antibodies which 
specifically bind to the protein-containing molecules. For 
convenience, the method described herein will be referred to 
as "serological fishing". 
BACKGROUND AND PRIOR ART 

It is fairly well established that many pathological 
conditions, such as infections, cancer, autoimmune disorders, 
etc., are characterized by the inappropriate expression of 
certain molecules. These molecules thus serve as "markers" 
for a particular pathological or abnormal condition. Apart 
from their use as diagnostic "targets", i.e., materials to be 
identified to diagnose these abnormal conditions, the 
molecules serve as reagents which can be used to generate 
diagnostic and/or therapeutic agents. A by no means limiting 
example of this is the use of cancer markers to produce 
antibodies specific to a particular marker. Yet another non- 
limiting example is the use of a peptide which complexes with 
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an MHC mol cule, to generate cytolytic T cells against 
abnormal cells* 

Preparation of such materials, of course, presupposes a 
source of the reagents used to generate these. Purification 
5 from cells is one laborious , far from certain method of doing 

so. Another preferred method is the isolation of nucleic acid 
molecules which encode a particular marker, followed by the 
use of the isolated encoding molecule to express the desired 
molecule. 

10 To date, two strategies have been employed for the 

detection of such antigens in, e.g., human tumors. These will 
be referred to as the genetic approach and the biochemical 
approach. The genetic approach is exemplified by, e.g., 
DePlaen et al., Proc. Natl. Acad. Sci. USA 85: 2275 (1988), 

15 incorporated by reference. In this approach, several hundred 

pools of plasmids of a cDNA library obtained from a tumor are 
transfected into recipient cells, such as COS cells, or into 
antigen-negative variants of tumor cell lines which are tested 
for the expression of the specific antigen. The biochemical 

20 approach, exemplified by, e.g., Falk et al., Nature 351: 290 

(1991), and Kawakami et al. , Nature 369: 69 (1994) both of 
which are incorporated by reference, is based on acidic 
elution of peptides which are bound to MHC-I molecules of 
tumor cells, followed by reversed-phase high performance 

25 liquid chromotogaphy (R-HPLC). Antigenic peptides are 

identified after they bind to empty MHC-I molecules of mutant 
cell lines which are defective in antigen processing, and 
induction of specific reactions in cytolytic T-lymphocytes. 
These reactions include CTL proliferation, TNF release, and 

30 lysis of target cells, measurable in an MTT assay, or a ftx Cr 

release assay. 

These two approaches to the molecular definition of 
antigens have the following disadvantages: first, they are 
enormously cumbersome, time-consuming and expensive; second, 
35 they depend on the establishment of cytolytic T cell lines 

(CTLs) with predefined specificity; third, their relevance in 
vivo for the course of the pathology or disease in question 
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has not been proven, as th respective CTLs can be obtained 
not only from patients with the respective disease, but also 
from healthy individuals, depending on their T cell 
repertoire. 

The problems inherent to the two known approaches for the 
identification and molecular definition of antigens is best 
demonstrated by the fact that both methods have, so far, 
succeeded in defining only very few new antigens in human 
tumors. See, e.g., van der Bruggen et al., Science 254: 1643- 
1647 (1991); Brichard et al., J. Exp. Med. 178: 489-495 

(1993) ; Coulie, et al., J. Exp. Med. 180: 35-42 (1994), 
Kawakami et al., Proc. Natl. Acad. Sci. USA 91: 3515-3519 

(1994) . 

It would be desirable to have available a method which 
can be used not only for detection of tumor-associated 
antigens, but to determine molecules associated with any 
abnormal or pathological condition. Such a method would also 
facilitate the identification of such molecules, thereby 
enabling their use on the generation of, e.g;, antibodies, 
cytolytic T cells, and so forth. 

It is therefore the purpose of the present invention to 
develop methods and reagents for the simple detection and 
molecular characterization of antigens in human tissues, 
especially in tumor cells, which are useful in the molecular 
diagnosis of diseases, and/or for immunotherapy and gene 
therapy of infectious, autoimmune and malignant diseases. The 
invention is delineated in the disclosure which follows. 

BRIEF DESCRIPTION OF TOE FIGURES 

Figure 1 shows the principles of the approach of the 
invention . 

Figure 2 shows a nitrocellulose membrane with a positive 
clone derived from the cDNA of a renal cell clear carcinoma 
that reacts with a 1:100 dilution of the patient's serum. 

Figure 3 shows, in bar graph form, the Northern Blot 
analysis of clone H0M-RCO313 in renal cell carcin ma, normal 
kidney and other human tissues. 
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Figure 4 shows the translated region of the gen coding 
for HOM-RCC-313. 

DETAILED DESCRIPTION OF PREFE RRED EMBODIMENTS 

The following disclosure describes a methodology referred 
to as serological fishing. In it, a cell sample is taken from 
a subject afflicted with a pathological condition. The cells 
preferably are exemplary of the pathology. For example, if 
the subject has melanoma, the cells are melanoma cells. If 
the subject is suffering from a neural disorder, e.g., then 
the cells are preferably a sample of the afflicted cells. 
This approach is warranted because the afflicted cells are 
most probably the best source of protein-containing molecules 
of interest, i.e., such molecules which are specifically 
associated with the pathological condition of interest. 

Note that cells representative of pathological conditions 
are not the only cells which may be used in the inventive 
method. It is very important, e.g., to ascertain those 
cellular "markers" associated with differentiation and 
maturation of cells, for example. The example of 
hematopoietic stem cells comes to mind. Similarly, the 
invention contemplates the isolation of, e.g., receptor 
molecules for specific ligands. In effect, one can assay for 
the presence of any molecule of interest using this 
methodology. 

The cells chosen are then used to prepare a library of 
complementary DNA (i.e., "cDNA"). This methodology is well 
known to the skilled artisan, and need not be reiterated here. 
It is, of course, based upon the established fact that if 
proteins are expressed by the cells, then messenger RNA (mRNA) 
must be present. These mRNA molecules are not long lived, and 
are unstable, so they are not practical to work with. The 
stability brought to the molecules when cDNA is used is very 
helpful to the method. 

Once the cDNA is made, it is used to construct a vector 
library. In short, carrier vectors are treated, such as by 
cutting and splicing, to receive molecules f cDNA. The 
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choice of v ctor may vary, as the skilled artisan is well 
familiar with many such examples. 

Especially preferred are virus based vectors. In the 
case of eukaryotic cells, retrovirus or adenovirus based 
vectors are preferred. Such vectors contain all ^br a part of 
a viral genome, such as long term repeats ( "LTRs" ) , promoters 
(e.g., CMV promoter, SV40 promoter, RSV promoter), enhancers, 
and so forth. When the host cell is a prokaryote, then 
bacterial viruses, i.e., phages, are preferred. Exemplary of 
such vectors are vectors based upon, e.g., lambda phage, in 
any case, the vector may comprise elements of more than one 
virus . 

The resulting vectors are transf ected or transformed into 
a host cell, which may be eukaryotic or prokaryotic. 

Any cell normally used for transfection or transformation 
may be used in the protocol. Preferred materials include 
strains of £. qqIL, CHO cells such as CHO-l, COS cells such as 
cos-7, and so forth. Similarly, yeast cells, e.g., strains of 
Saccharomyces , strains of Pseudomonas, such as Pseudompn^ g 
aeruginosa. Bacillus bacteria, well known insect host cell 
SpodQpteira f rug i per da . and so forth, may all be used. 

Once the recipient cells receive the vectors, they are 
cultivated so as to express foreign, protein containing 
molecules. "Protein-containing" is used herein because, while 
prokaryotes express only proteins, eukaryotic cells are well 
known for their ability to post-translationally modify 
proteins, so as to produce glycoproteins, lipoproteins, e.g. 
It must also be borne in mind that "protein containing" as 
used herein, also encompasses peptides, such as the peptides 
presented by MHC molecules. 

The processes now described below take place 
independently of the process described above, and no 
chronological relationship between the two facets of the 
invention is intended. 

In pathological conditions such as cancer and, e.g., 
autoimmune diseases, there is some immune reaction to 
molecules associated with th pathology. This reaction can 
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include an antibody response, B cell proliferation, 
proliferation of specific T cell subpopulations , increases in 
cytokine production, and so forth. The molecules and cells 
associated with the response may be found in body fluids of a 
subject, such as his or her serum. The immune responders will 
react with the molecule of interest whether it is produced 
recombinantly or autologously. The problem is to find them. 
As the examples show, this is done in a unique way. First, 
the body fluid, or other sample of interest, is reacted with 
a sample of the same host cells used for transfection or 
transformation. In this first step, the host cells are not 
transfected or transformed. The effect of this is to strip 
any immunogenic binding partners specific for the host cell 
rather than the targeted molecule. This step is necessary 
because, as was pointed out, susra, the host cell may be one 
against which the subject has developed an immune response at 
some point. This first stripping removes these immune 
components . 

A second stripping step is then carried out. In this 
step, the previously stripped sample is now reacted with a 
sample of the same host cell as was described supra P this time 
having been transfected or transformed with the carrier vector 
lacking cDNA from the subject. The reason for this second 
stripping step is an observation made by one of the present 
inventors and not reported in the literature previously. The 
materials used as vectors, such as phages, viruses, etc., are 
useful because they naturally infect cells. Thus, £. coli, 
which inhabit the lower intestine of humans, are infected with 
lambda phage. It had not been considered, previously, that 
the immune response to £. sali includes a response to these 
infectious agents. Thus, applicants have surprisingly, 
achieved an ability to remove interfering immune components to 
an unprecedented degree by carrying out the two stripping 
steps. As noted, the first is against untransfected or 
untransformed host cells. The second is against host cells 
transfected or transformed with a vector which does not carry 
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cDNA, wherein the vector is immunologically equivalent to the 
vector us d to carry cDNA, as described supra » 

It is especially preferred to carry out each of these 
stripping steps using a plurality of similar, but different 
procedures. The experiments which follow, for example, show 
absorption on a solid phase column, and then absorption on 
nitrocellulose paper. Applicants do not wish to be bound by 
any theory as to why the use of two similar but different 
protocols produces the results described herein. It is to be 
borne in mind, hereafter, that whenever "contacting a sample" 
is used herein, it is not to be limited to one contact step 
only, but may refer to more than one, preferably different, 
contact protocols designed to remove interfering binding 
partners from a sample under scrutiny. 

It should be understood that these stripping steps may be 
done completely independently of the steps used to prepare the 
cDNA library. For example, if the test for an antigen is to 
be done at day w O" , the stripping of sample may be done the 
day before, a week before, and so forth. One can also "bank" 
stripped sample from a donor or subject for future use. 

The sample used is preferably serum, but need not be. 
Any sample which contains immunogenic binding partners may be 
so used. 

In the next step of the method, lysed, transfected cells 
carrying the cDNA and expressing heterologous protein are 
contacted with the twice stripped sample* This sample should 
only contain immune components specific for the heterologous 
protein, and should bind thereto. This binding is facilitated 
if the cell lysates have been immobilized via contact to, 
e.g., activated filter paper, a solid phase column, etc., but 
this solid phase binding is not necessary, as the art will 
surely recognize that many, varied forms of assays are 
available for identifying a molecule of interest. 

Once the immune component binds to the target molecule, 
a further step is desirably, but not necessarily, carried out. 
This additional step involves the use of some binding partner 
for the first immune component, such as anti-IgG, carrying an 
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identifiable label. The label may be a dye, an enzyme, a gold 
particle, a radiolabel, or any of the standard lab Is used in 
immunoassays. 

Once identification is carried out, the immune components 
are removed, leaving the target molecule. The target molecule 
is then studied, using any of the standard methodologies in 
the art. 

The artisan will note that the methodology also results 
in isolation of immune components which bind to the molecule 
of interest. Thus, in another aspect of the invention one can 
isolate antibodies, e.g., which are specific binding partners 
for the molecule of interest. 

Yet another immune component which may be identified and 
isolated following the invention is a cytolytic T cell ("CTL" 
hereafter) , specific for complexes of peptides derived from 
the identified molecule and MHC molecules to which these 
peptides bind, forming a complex. It is fairly well accepted 
that a CTL response involves the identification of complexes 
of MHC molecules and peptides, generally about 8-12 amino 
acids in length, but most preferably 9 or 10 amino acids in 
length, by T cell receptors ("TCRs") on the surface of 
circulating T cells* The TCRs react by binding to these 
complexes, "setting in motion," as it were, a series of 
reactions including the proliferation of CTLs specific for 
these complexes. One can produce and/or isolate such CTLs 
using the method of the invention, plus further steps. 

As is pointed out in the examples which follow as well as 
the disclosure in general, one can easily identify cDNA 
encoding an antigen of interest. Once the cDNA is identified, 
one uses it to transfect host cells which either already 
present desired MHC molecules on their surface, or which have 
been trans feet ed with DNA encoding these MHC molecules. The 
cDNA for the molecule of interest is expressed, and the 
molecule is processed to antigenic peptides which are 
presented by MHC molecules, such as HLA molecules. CTLs 
directed against the complexes are obtained from lymphocytes, 
such as autologous lymphocytes. From r sponder cell 
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populations, long-term CTL clones are then obtained by the 
well known technique of limiting dilution. Once a positive 
CTL response is observed, the specific peptides presented to 
the CTLs are identified using established methods for example, 
5 screening the specific of previously identified CTL clones. 

Alternatively, the more recently described method of studying 
the sequence of the molecule of interest to identify potential 
MHC-binding motifs then analyzing these peptides, first for 
binding to the relevant MHC molecule and then, if positive for 

10 MHC-binding, for their ability to generate CTLs recognizing 

the peptide MHC complex may be used. Of course the peptides 
can also be eluted from the cells and sequenced, using well 
known techniques. 

It will also be noted by the skilled artisan that one can 

15 correlate the expression of the molecule of interest back to 

a particular host cell or cells which expressed it. In so 
doing, one can remove the cDNA which expressed the molecule of 
interest, sequence it, and so forth. This aspect of the 
method is another feature of the invention. 

20 Specific embodiments of the invention will be seen in the 

examples which follow. Figure 1 depicts the method generally. 

Example 1 

For the establishment of a cDNA library from human tissue 
total RNA was obtained from 0.5 ug of a renal clear cell 

25 carcinoma and established according to the method of 

Chomzynski, J. Analyt. Biochem. 162: 156*159 (1987), 
incorporated by reference. The mRNA was extracted from total 
RNA with oligo-dT-cellulose. The synthesis of the first 
strand cDNA was accomplished by the method described by Gubler 

30 and Hoffmann, Gene 25: 263 (1983) using RNase H and DNA 

polymerase I . For adaptation of the cDNA Klenow enzyme , 
adaptors with EcoRI restriction enzyme sites were ligated to 
the cDNA ends using T4 DNA ligase (Ferretti L and Sgamerella 
V, Nucl. Acids Res. 9: 3695 (1981)). Following restriction 

35 enzymatic digestion with the enzyme Xhol r cDNA molecules of 

different length were separated using Sephacryl 400 and 
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transfected into AZAPII phage vectors (Short et al., Nucleic 
Acids Res. 16: 7583 (1988)). The recombinant phage DNA was 
packed into phages after ligation with packing extracts and 
used for the transfection of £. coli bacteria. The titration 
5 of the library resulted in 1.8 x 10* recombinant primary 

clones. The total cDNA library was transfected in £. coli and 
amplified. The titer of the cDNA library after amplification 
was 10" plaque forming units per ml (pfu/ml). These 
transfected cells were used in experiments which follow. 

10 BXQBPle 3 

In accordance with the invention as described, supra r 
identification of immunogenic material was achieved by using 
human sera which have been completely depleted of antibodies 
directed against antigens derived from native and lytic A 

15 phage-transfected £. call bacteria. To this end, the serum 

was "stripped" via absorption, as now described. 

£. coli bacteria of the strain XLl-Blue were cultured in 
50 ml LB medium overnight. After achieving an optical density 
of ODftoo = 1.0, the bacteria were pelleted by centrif ugation, 

20 resuspended in 5 ml phosphate buffered saline (PBS), and 

sonicated by ultrasound to form a lysate. The bacterial 
lysate was bound onto a matrix of activated Sepharose, which 
was then put into a column and used for the absorption of the 
human serum. The serum was run over this column 10 times. 

25 A culture of £. cali XLl-Blue bacteria in the exponential 

growth phase was pelleted by centrif ugation, transfected in 
0.01 M magnesium sulfate with 10 6 AZAPII phages without a 
recombinant insert and incubated in 5 ml LB medium for four 
hours. The lysate of the transfected bacteria was used in the 

30 same manner as the untransf ected bacteria, with the human 

serum described supra being passed through the column an 
additional ten times. 

To complete the depletion cf the serum, interfering 
antibodies from lytically transfected £• coll bacteria were 

35 cultured on agar plates (10 hours, 37*C) and their proteins 

were blotted onto nitrocellulose membranes after this 
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culturing step. Following this, the serum which had been 
preabsorbed according to the above steps was transferred to 
the blotted nitrocellulose membrane, and the absorption 
procedure was repeated five times. The serum, which was 
processed in accordance with the invention, tfas totally 
depleted of antibodies directed against antigens derived from 
E. £Qli and phages. 
Example 3 

In these experiments, a renal cancer-specific antigen was 
identified via the following steps. Bacteria of the strain £. 
coli XLl-Blue were transf ected with recombinant phages derived 
from the described cDNA library and plated at a density of 4- 
5x10* plaque forming units (PFUs) per plate in LB-medium with 
isopropylthiogalactopyranoside ("IPTG"). After 12 hours of 
incubation at 31* C, nitrocellulose membranes were put on top 
of the cultures, and the culture plates were incubated for 
another four hours. This was followed by incubation of the 
nitrocellulose membrane for one hour in Tris-buf f ered saline 
(TBS) with 5% milk powder. After washing the nitrocellulose 
membranes three times in TBS, the stripped human serum secured 
following Example 2 was diluted 1:1000 in TBS/0.5% milk powder 
(w/v) and incubated overnight with gentle shaking. After the 
incubation with the nitrocellulose membrane the serum was 
removed and kept for additional testing. Following incubation 
with serum, the nitrocellulose membranes were washed three 
times in TBS, and incubated with polyclonal alkaline 
phosphatase-conjugated goat anti-human IgG for one hour. 
Following this, the nitrocellulose membranes were washed 
repeatedly with TBS/0,01% Tween 20 (v/v)). The reaction was 
developed using nitroblue tetrazolium chloride and 
bromochloro-indoyl-phosphate in TBS. The binding of human 
antibodies to the expressed protein became visible by a blue, 
ring-formed color deposit on the nitrocellulose membrane. The 
efficient preabsorption of the serum made it possible to 
develop the membrane at 37 *C over several hours without 
compromising the quality of the test because of background 
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reactivity caused by antibodies against E. coli and phage 
antigens » 

Positive clones were localized on the agar plates, 
transferred into transfection buffer, and used for a second 
round of transfection and subcloning. A total- of 1.8x10* 
recombinant clones were subjected to screening and five 
different positive-reacting clones were identified. 
Example 4 

Positive clones secured following Example 3, i.e., those 
which had bound antibodies derived from the processed human 
serum, were subcloned to monoclonality by repeated rounds of 
transfection and testing of reactivity with the processed 
human serum. P-bluescript phagemids with the respective cDNA 
inserts were cloned by in vivo excision (Hay and Short, 
Strategies 5: 16-19, 1992) from the AZAPII phage vectors and 
used for transfection of E* coli SOLR bacteria* Plasmids were 
isolated from the bacteria after alkaline lysis with NaOH in 
a modification of the method of Birnboim and Doly, J. Nucl. 
Acids Res. 7: 1513 (1979). The recombinant plasmid DNA was 
seguenced according to the classic method of Sanger (Proc. 
Natl. Acad. Sci. USA 74: 5463 (1977)) using M13-forward and 
M13-reverse oligonucleotides. The DNA seguence obtained and 
the resulting amino acid sequence were checked for in nucleic 
acid and protein data banks (Gene Bank, EMBL, Swiss Prot). 
The sequencing of the cDNA inserts was continued using 
internal oligonucleotides. Analysis showed no homology with 
any sequences deposited in the data banks. The full length 
cDNA clone referred to as SK313, which had been cloned with 
the RACE method (Frohman MA, Dush MK, Martin GR, Proc. Natl. 
Acad Sci. USA 85: 8998 (1988)), had a carbonic anhydrase 
domain at the 5' end. The nucleic acid sequence of this 
molecule is presented in SEQ ID NO: 1. Figure 2 shows a 
nitrocellulose membrane with a positive clone from these 
experiments • 
sample 5 

As a follow up to these experiments, RNA was isolated 
from a spectrum of malignant and normal human tissues 
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according to the method of Chomzynski and Sacchi, Analyt 
Biochem. 162: 156 (1987). After denaturation, the total 
isolated RNA was separated on an agarose gel containing 1% 
formaldehyde by electrophoresis (Goldberg, Proc. Natl. Acad. 
5 sci. USA 77: 5794 (1980)) and then blotted onto a nylon 

membrane according to a known method (Seed, Nucl. Acids Res. 
10: 1799 (1982)) Radiolabeled cDNA inserts of the identified 
clones were used for hybridization. The hybridization was 
carried out according to a known method (Geoffrey and Berger, 

10 Enzymol. 152: 419 (1987)). The presence of the respective RNA 

was demonstrated using autoradiography and X-ray films. The 
analysis demonstrated that the mRNA of clone HOM-RCC-313 was 
overexpressed in 4 out of 19 renal cell carcinomas compared to 
normal kidneys. Ve ry SEfiafc expression was found only in 

15 colonic mucosal tissue and in normal kidney. Expression in 

other tissues could not be demonstrated. 
Example 6 

To determine the incidence of antibodies against antigens 
which are identified in accordance with the invention , sera 
20 from healthy individuals and tumor patients were analyzed. To 

this end, the sera were processed as described, supra . and 
depleted of antibodies against antigens derived from £. coli, 
and phages. For the detection of antigen-specific antibodies, 
phages derived from reactive clones were mixed with non- 
25 reactive phages derived from the same cDNA library at a ratio 
of 1:10 and tested as described supra . for reactivity with 
antibodies in the human test serum. The serum which had been 
used for the identification of the antigen was used as a 
positive clone. The non-reactive phages served as a negative 
30 control. A serum sample was positive for antigen reactive 
antibodies, if the expected percentage of the phage plagues 
showed a positive reaction. In the case of the renal cell 
carcinoma antigen represented by clone HOM-RCC-313, the 
analysis of a spectrum of human sera showed that only sera 
. 35 from renal cell carcinoma patients contained reactive 
antibodies. Sera from healthy controls and patients with 
other tumors did not contain such antibodies. 
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The cDNA for clone HOM-RCC-313 was xcised from the 
plasmid DNA by digestion with the restriction enzyme EcoRl, 
and separated by agarose gel electrophoresis, followed by 
extraction from the gel. This was then used to create a 
vector which expresses a fusion protein with the bacterial 
protein anthranilate synthetase. A relevant fragment in the 
exact open reading frame was cloned into pATH plasmid vectors 
(Koerner, et al, Meth. Enzymol. 194: 477 (1991). Induction of 
protein expression was obtained after transformation of the 
plasmids into £. qqU of strain BL21 as described (Spindler, 
et al, J. Virol. 49: 132 (1984)). Expressed, fusion proteins 
were separated by SDS gel electrophoresis, excised from the 
gel, eluted and freeze dried. Rabbits were immunized by 
subcutaneous injection with 100 fxg of the lyophilisate 
dissolved in Freund's adjuvant. Immunization was repeated 
three times at two-week intervals using incomplete Freund's 
adjuvant. The rabbit was bled and antiserum was obtained. 
The obtained antiserum was depleted from antibodies reactive 
with E- GQlx and phages in the manner described supra and 
tested for reactivity against the renal carcinoma antigen as 
described for the human serum. Reactivity was detected at 
dilutions of 1: > 100,000. 
Example 7 

The protocols set forth in the preceding examples were 
followed, using biopsied tissue taken from different subjects 
suffering from (i) malignant melanoma, (ii) astrocytoma, and 
(iii) Hodgkin's Disease. Table 1, which follows, summarizes 
the results , including those obtained with the renal cancer 
study, set out in detail in Examples 1-6, supra . 
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Table 1. Antibody reactivity of autologous sera with 
recombinant clones derived from human tumor cDNA. cDNA 
libraries were screened with autologous patient serum. 
Positive clones were subclone*} to monoclonality . Inserts from 
each clone were amplified with plasmid primers and separated 
by agarose gel electrophoresis. Southern blots were performed 
by cross hybridization with the respective inserts. 



clones positive different 

tumor tested clones inserts 

malignant melanoma 1.0 x 10* 40 10 

renal cell carcinoma 1.8 x 10* 7 5 

astrocytoma 1.2x10* 49 5 

Hodgkin's disease 1.0 x 10* 14 4 



Analysis of the different inserts showed that the 
melanoma cells expressed the known tumor rejection antigen 
precursor MAGE-1 (see van der Bruggen et al., Science 254: 
1643-7 (1991), incorporated by reference), as well as a new 
antigen. A portion of the cDNA sequence of this antigen is 
set forth in SEQ ID NO: 2. 

When the astrocytoma study was completed, the observed 
insert appeared to correspond to the previously described Tegt 
gene (Old, Cane. Res. 41: 361-375 (1981), incorporated by 
reference) . 

When the Hodgkin's Disease study was completed, a 
previously unknown antigen was isolated, and cDNA encoding it 
was identified in the library, using standard methods. The 
antigen is a newly observed, lectin-like structure, a portion 
of the cDNA for which is set forth in SEQ ID NO: 3. Also 
observed were antibodies against restin, described by Bilbe, 
et al, EMBO J 11: 2103-13 (1992). This is an intermediate 
filament associated protein, expression of which has been 
shown to be restricted to Hodgkin and Reed- Sternberg cells, as 
well as cultured monocytes. 
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A further study of occurrence of antibodies against the 
antigens described in Examples 1-7 was carried out. Table 2 
summarizes these assays. In these studies, phages from 
positive clones were mixed with non-reactive phage 
(ratio:l:10) f and then used to transfect bacteria (e. coli). 
Dilutions of patient sera (1:200) , were used, in an enzyme 
linked immunosorbent assay (ELISA) , as described supra. "HOM- 
MEL-40" refers to the new melanoma antigen (SEQ ID NO: 2), 
while "HOM-MEL-55" refers to HAGE-1 (van der Bruggen et al., 
supra). "HOM-RCC 3.1.3" is the renal cancer antigen of SEQ ID 
NO: l. "HOM-GLO-30.2.1" refers to the previously identified 
astrocytoma associated antigen, "HOM-HD-21" refers to the new, 
lectin-like antigen of SEQ ID NO: 3, and "HOM-HD-397" is the 
previously identified restin antigen. 
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The fact that antibodies against the tumor antigens, excepting 
only res tin, were detected, albeit at varying rates, only in 
the sera of patients diseased with the same type of tumor 
suggests that tumor growth is essential for the development of 
a humoral response against tumor antigens. 

The reason for the presence of restin in healthy controls 
is not clear. One may speculate that tolerance against 
respective antigens might be circumvented, because the antigen 
may have similar sequences to another antigen, the donor may 
have premalignant cells, or the antigen may be activated in 
normal cells under non-malignant conditions, such as viral 
infections, or other inflammatory processes. 
Example 9 

In order to determine the expression pattern of the newly 
identified antigens described herein, Northern blot analysis 
was carried out, using a variety of human tissues. 

RNA was extracted from tissue samples (tumor and normal) 
using the well known guanidium 
isothiocyanate /phenol /chloroform method of Chomzynski, et al. , 
supra . The RNA integrity was checked via electrophoresis in 
formalin/MOPS gels. Then, gels containing 40 ug of RNA per 
lane were blotted onto nylon membranes. These Northern blots 
were then probed with the cDNA of SEQ ID NO: 1, 2 or 3. 
Hybridization was with 33 P labelled probes at 42 °C, with 
formamide. The filters were washed at 65°C, at lxSSC, 0.2% 
SDS, and exposed for 16 hours. These are "stringent 
conditions" as defined hereafter. After exposure, filters 
were stripped and rehybridized with GAPDH. 

Table 3 summarizes these results. 
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As will be seen, the new melanoma associated antigen is 
strongly expressed in melanoma, but not other tissues. 
Carbonic- anhydrase- like antigen was strongly expressed in 
about 20% of renal cell carcinomas, and only weakly in normal 
renal tissue. Tegt was overexpressed on 8/12 astrocytoma 
tissues compared to normal brain tissue. The tnRNA for the 
lectin like molecule associated with Hodgkin's disease was 
increased about ten fold in diseased tonsils as compared to 
normal tonsils, suggesting that overexpression may be a 
frequent characteristic of proteins which elicit autologous B 
cell responses. 
Example 10 

Further studies were carried out on the HOM-MEL-40 
sequence. Using standard genetic analysis techniques, the 5' 
region of the mRNA for HOM-MEL 40 was shown to have a tyrosine 
kinase binding domain. This suggests that HOM-MEL-40 may 
function as a receptor. The 3' portion of the RNA is 
identical with an RNA molecule for "SSX, n a molecule known to 
be involved in the SYT-SSX translocation in synovial tumors. 
Example 11 

Additional experiments were also carried out to study 
HOM-MEL 40. Standard Northern blotting showed that, with the 
exception of testis, HOM-MEL 40 was not expressed in normal 
tissues. In contrast, it was expressed in 50% of melanomas, 
20% of prostate cancers, 20% of gastric cancers, 26V of 
colorectal cancers, 12% of the lung cancers and 20% of breast 
and hepatacellular cancer. It was also found in 1/10 gastric, 
and 1/5 thyroid carcinomas. 

Additional Western blotting work was carried out, showing 
that antibodies against HOM-MEL 40 were present in 10 of 89 
melanoma patients tested, but only 3 out of 49 healthy male 
sub j ects . 

In yet further studies, it was observed that HLA-A2 
positive tumor cells presented a nonamer derived from HOM-MEL. 
This suggests that HOM-MEL 40 specific vaccines, useful in 
inducing CTLs, are possible. 
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Example 12 

The phage assay described in the prior examples is not 
appropriate . for screening large numbers of serum samples. In 
order to do so, a modification of the standard Western Blot 
was developed. This variation is based upon* His -tagged, 
recombinant HOM-MEL 40, as is herein described. 

HOM-MEL-40 was amplified, over 20 cycles, using pfu 
polymerase, on plasmid cDNA prepared from melanoma tissue. 
The oligonucleotide primers used were: 

5' -GCCAAATACTTCTCTAAGGAAGAGTGG-3' (SEQ ID NO: 4); (sense) 

5 ' - TTCACTGTTGTGAACACTTGCTTTCAC- 3 ( SEQ ID NO : 5 ) ; 

(antisense) 

Polymerase chain reaction (PCR) was carried out at 95°C/l 
minute; 60°C/1 minute; and 72°C/1 minute, followed by a final 
extension at 72°C for 10 minutes. The amplification product 
was gel purified, using art recognized techniques, and then 
ligated in frame to Smal digested, dephosphorylated and gel 
purified pQE32 vector. This results in production of a fusion 
protein having a "tail" of 6 histidine molecules at the N- 
terminus. 

The construct was then transformed into E. coli SG13009 
(pREP4) strain, followed by selection on plates containing 
kanamycin and ampicillin. Individual colonies were picked, 
and expressed on a small scale by inducing th<ese with 2 mM 
isopropyl thiogalactoside {IPTG) . This permits checking for 
protein expression. Small scale purification over Ni-NTA 
columns was then performed, for each clone. 

One clone was identified as expressing a protein of 
expected length. This clone was isolated and sequenced using 
well known techniques. It was verified as HOM-MEL-40 . 
Following the identification, large scale induction of 
recombinant protein was carried out. Specifically, cells were 
induced with 2 mM IPTG, and harvested five hours later. Cells 
were lysed, by combining with a buffer of 8 M urea, 100 mM 
Na 2 P0 4 , 10 mM Tris-HCl <pH8) , 0.01% Triton X, overnight. Any 
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cellular debris was spun down, and supernatant was loaded onto 
pre-equilibrated.Ni-NTA resin. Washes were performed with two 
volumes of the aforementioned buffer, at pH8, and then at 
least 10 volumes of the buffer, at pH6.3. Protein was then 
5 eluted, using the buffer described herein, plus 250 mM 

imidazole. Yields ranged from 15 to 40 mg of His -tagged 
protein per liter of bacterial culture. 

Western Blotting was then carried out, using the His- 
labeled protein. In these assays, 5 ug of recombinant His- 

10 tagged proteins, which served as internal negative controls, 

were mixed with 2xSDS sample buffer (0.1 M Tris-HCl, pH 6.8, 
0.2M dithiothreitol, 4% SDS, 0.2% bromophenol blue, 20% 
glycerol), and were then electrophoresed in 12% SDS-PAGE, 
followed by blotting to nylon membranes using semi -dry 

15 transfer. 

After blocking of any unspecified binding with 5% low fat 
milk in PBS (1 hour) , membranes were incubated with 1:100 
diluted sera from tumor patients or healthy controls. Blots 
were then incubated for one hour with alkaline phosphatase 

20 conjugated mouse ant i -human IgG. The membrane was then 

incubated, consecutively, with rabbit anti-mouse IgG (30 
minutes), anti-alkaline phosphatase, and then 0.25 mg/ml of 
alkaline phosphatase. After each incubation step, the 
membranes were washed extensively with TBS and 0.1% Tween. 

25 Visualization was performed by staining with 5-bromo-4-chloro- 

3 -indolyl -phosphate (BCIP) , and nitroblue tetrazolium. The 
sera were analyzed in random order (healthy /melanoma 
positive) , with the observer blind to the status of the 
sample. All analyses were carried out in duplicate. 

30 The blotting showed a product with a molecular weight of 

about 24 kD by SDS-PAGE, which is consistent with a calculated 
molecular weight of 21.6 kD, based upon predicted amino acid 
sequence . 

The immunoblotting described supra, was carried out on 89 
35 melanoma samples, six ovarian cancer samples, and ten renal 

carcinoma samples. Of these, 11 melanoma samples, one ovarian 
cancer sample, and three renal cell carcinoma samples were 
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positive. Sera from subjects with colorectal, lung, breast, 
gastric, or pancreatic cancer were negative. . A total of 41 
healthy controls were also analyzed, of which three were 
positive. Any sera which were reactive in the Western blot, 
as well as twenty negative serum samples were then -reassessed, 
using the phage assay, described supra . The reactivity was 
confirmed for 10 of the 11 melanoma patients and the positive 
ovarian cancer patient. The renal cell cancer patient and the 
healthy controls were negative, and were all samples negative 
in the Western blot. 

There were 16 melanoma patients who provided both serum 
and tumor specimens. It was found that HOM-MEL-40 expression 
by the tumor could be compared with antibody reactivity in the 
sample. As will be seen in the table which follows, eight of 
the sixteen patients had HOM-MEL-40 positive tumors, but only 
three had antibodies against the antigen in their serum. No 
antibodies were detected in the serum of patients with HOM- 
MEL-40 negative tumors. 
Exampltj?_H 

Rammensee, et al., Immunogenetics 41: 178-228 (1995), 
incorporated by reference, disclose many peptides which bind 
to HLA-2.1, and some which also provoke 
CTL proliferation. Some of these are of formula: 

XaaLeu (Xaa ) 6 ( lie , Leu , Val ) 
(SEQ ID NO: S) . The deduced amino acid sequence for HOM-MEL- 
.40 was screened for sequences which might act as HLA-A2.1 
binders/CTL stimulators, and the following were found: 
Arg Leu Gin Gly He Ser Pro Lys He (SEQ ID NO: 7) ; 
Arg Leu Arg Glu Arg Lys Gin Leu Val (SEQ ID NO: 8) ; 
Lys He Gin Lys Ala Phe Asp Asp He (SEQ ID NO: 9) 
These peptides were synthesized, using well known techniques 
and Fmoc protected amino acids. The peptides were then 
purified, using Sephadex G25, followed by reverse phase HPLC 
on a C-18 column. T2 cells, which are deficient in 
transporters associated with antigen presentation (DeMars, et 
al., Proc. Natl. Acad. Sci. USA B2: 8183-8187 (1985); Slater, 
et al., Immunogenetics 21: 235-241 (1990)), were used in 
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peptide binding assays, as follows. A sample of 5xl0 5 T2 
cells were incubated for 4 hours at 37°C, in the presence or 
absence of 100 uNI of the HOM-MEL-40 peptides. Positive 
controls were an EBV LMP2 derived peptide: 

Cys Leu Gly Gly Leu Leu Thr Met Val (SEQ ID^O: 10) 
and an HIV reverse transcriptase derived peptide: 

lie Leu Lys Glu Pro Val Gly Val (SEQ ID NO: 11) 
Deregulation of HLA-A2.1 on T2 cells was measured by labelling 
with an anti-HLA-A2.1 specific monoclonal antibody (i.e., 
BB7.2), followed by incubation with an FITC conjugated goat 
anti -mouse antibody. The samples were analyzed by flow 
cytometry, with upregulation of HLA-A2.1 being given by the 
ratio: 

mean fluorescence intensity 
in sample with peptide 



mean fluorescence intensity 
in sample without peptide 

Each of the three peptides were seen to bind to HLA-A2.1, with 
SEQ ID NO: 8 showing strongest HLA-A2.1 upregulation by far; 
as determined by FACS analysis. 

As the foregoing shows, the invention relates to a method 
for determining or isolating an immunoreactive substance. 
"Immunoreactive substance" as used herein refers to any 
material which provokes Borne form of immune response in the 
subject which produces it. This response may be based upon 
either a B cell or a T cell response. Such immunoreactive 
substances include proteins, peptides, glycoproteins, 
lipoproteins, peptide containing complexes (e.g., MHC/peptide 
complexes) , antibodies, and so forth. To determine such 
substances, a cDNA library is prepared from cells of a 
subject, using well known, standard methods. The cDNA is then 
inserted into an appropriate vector, such as a eukaryotic cell 
specific virus or a phage (i.e., a bacterial virus), to form 
a transfecting/transforming library, which is then 
incorporat d into a host cell. The host cells are treated so 
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that they express the library component (cloned cDNA) they 
receive. Th£ host cells are then lysed, so that the expressed 
material is available for further tre£tment. 

The lysed material is then contacted with a "stripped" 
5 sample believed to contain an immunogenic binding, partner for 

the immunoreactive substance- "Immunogenic binding partner" 
as used herein refers to any immune system associated material 
which binds to the target, i.e., the immunoreactive substance. 
Such binding partners include, but are not limited to, 

10 antibodies, T cells, cytokines, ligands, receptors, and so 

forth, as well as truncated portions of these molecules, 
complementary nucleic acid molecules, and so forth. Note that 
for some of these components, such as T cells, further steps 
including those recited herein are required. 

15 The stripped sample, as indicated supra , has been treated 

by contact with both (i) non-transf ected or transformed host 
cells, and (ii) host cells transf ected or transformed with 
vectors which do not contain the pertinent cDNA. 

The stripped sample is useful for identifying binding 

20 partners for the expressed material because many of the immune 

components which would otherwise interfere with the specific 
immunological reaction desired have been removed via the 
absorption steps described herein. 

The identification of the expressed material may be 

25 followed by isolation of the cDNA encoding it. One can punch 

holes through a membrane such as a nitrocellulose membrane, 
placed on top of Petri dishes containing colonies of host 
cells, then use the immune reaction to give position on the 
solid phase. Each colony is based upon limited cDNA 

30 transf ection, thereby facilitating isolation and 

identification of relevant cDNA. 

The invention also relates to the isolated nucleic acid 
molecules of SEQ ID NO: 1, 2 or 3 , which encode for molecules 
which are associated with particular conditions. In addition 

35 to their role as coding materials, these molecules can also be 

used as probes to identify cells expressing the relevant 
antigens, as it has been shown that these cDNA molecules (SEQ 
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ID NO: 1, 2 and 3) are based upon mRNA which translat d to the 
antigen. 

Also a part of the invention are isolated nucleic acid 
molecules, the complementary sequences of which hybridize to 
one of SEQ ID NO: 1, 2 or 3, and which encode a protein 
equivalent to those encoded by SEQ ID NO: 1, 2 or 3. 
"Stringent conditions" as used herein, refers to conditions at 
least as stringent as hybridization at 50 fxl/cm 2 of 3.5xSSC, 
lxDenhardt's solution, 25 mM sodium phosphate buffer (pH 7.0), 
using a 32 P-labelled probe, for 18 hours at 65°C, followed by 
four washes (one hour, each wash, at 65°C, 2xSSC, 0.1% SDS) , 
and a final wash for 3 0 minutes at l.OxSSC 0.2% SDS. The 
final wash can be changed to O.SxSSC to 0.2xSSC, or even 
O.lxSSC, and SDS can be lowered to 0.1% to increase 
stringency, if desired. 

The invention also includes those peptides associated 
with tumor antigens, such as those of SEQ ID NOS: 7, 8 and 9, 
which bind to HLA-A2.1 molecules, thereby provoking lysis by 
cytolytic T cells. Also a part of the invention are peptides 
of formula 

Xaa Leu Xaa 7 (SEQ ID NO: 12) 
wherein the sixth amino acid residue is Ser, Lys or Phe, and 
the ninth amino acid residue is Val or He. These molecules 
can also serve, very simply, as markers for HLA-A2.1 cells, as 
it is well known that peptide/MCH complex formation is quite 
specific. 

Other features of the invention will be clear to the 
skilled artisan and need not be reiterated here. 

The terms and expressions which have been employed are 
used as terms of description and not of limitation, and there 
is no intention in the use of such terms and expressions of 
excluding any equivalents of the features shown and described 
or portions thereof, it being recognized that various 
modifications are possible within the scope of the invention. 
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(2) INFORMATION FOR SEQUENCE ID NO: 1: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2679 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double-stranded 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



CGC6AAGATG CCCCGGCGCA GCCTGCACGC GGCGGCCGTG CTCCTGCTGG 50 

TGATCTTAAA GGAACAGCCT TCCAGCCCGG CCCCAGTGAA CGGTTCCAAG 100 

TGGACTTATT TTGGTCCTGA TGGGGAGAAT AGCTGGTCCA AGAAGTACCC 150 

GTCGTGTGGG GGCCTGCTGC AGTCCCCCAT AGACCTGCAC AGTGACATCC 200 

TCCAGTATGA CGCCAGCCTC ACGCCCCTCG AGTTCCAAGG CTACAATCTG 250 

TCTGCCAACA AGCAGTTTCT CCTGACCAAC AATGGCCATT CAGTGAAGCT 300 

GAACCTGCCC TCGGACATGC ACATCCAGGG CCTCCAGTCT CGCTACAGTG 350 

CCACGCAGCT GCACCTGCAC TGGGGGAACC CGAATGACCC GCACGGCTCT 400 

GAGCATACCG TCAGCGGACA GCACTTCTCC GCCGAGCTGC ACATTGTCCA 450 

TTATAACTCA GACCTTTATC CTGACGACAG NACTGCCAGC AACAAGTCAG 500 

AAGACCTCGC TGTCCTGGGT GCTCTCATTG AGATGGGCTC CTTCAATCCG 550 

TCCTATGACA AGATCTTCAG TCACCTTCAA CATGTAAAGT ACAAAGGCCA 600 

GGAAGCATTC GTCCCGGGAT TCAACATTGA AGAGCTGCTT CCG6AGAGGA 650 

CCGCTGAATA TTACCGCTAC CGGGGGTCCC TGATCACACC CCCTTGCAAC 700 

CCCACTGTGC TCTGGACAGT TTTCCGAAAC CCCGTGCAAA TTTCCCAGGA 750 

GCAGCTGCTG GCTTTGGAGA CAGCCCTGTA CTGCACACAC ATGGACGACC 800 

CTTCCCCCAG AGAAATGATC AACAACTTCC GGCAGGTCCA GAAGTTCGAT 850 

GAGAGGCT6G TATACACCTC CTTCTCCCAA GTGCAAGTCT GTACTGCG6C 900 

AGGACTGAGT CTGGGCATCA TCCTCTCACT GGCCCTGGCT GGCATTCTTG 950 

GCATCTGTAT TGTGGTGGT6 GTGTCCATTT GGCTTTTCAG AAGGAAGAGT 1000 

ATCAAAAAAG GTGATAACAA GGGAGTCATT TACAAGCCAG CCACCAAGAT 1050 

GGAGACTGAG GCCCACGCTT GAGGTCCCCG GAGCTCCCGG GCACATCCAG 1100 

GAA6GACCTT GCTTTGGACC CTACACACTT CGGCTCTCTG GACACTTGCG 1150 

ACACCTCAAG GTGTTCTCTG TAGCTCAATC TGCAAACATG CCAGGCCTCA 1200 

GGGATCCTCT GCTGGGTGCC TCCTTGTCTT GGGACCATGG NCACCCCAGA 1250 

GCCATCCGAT CGATGGATG6 GATGCACTCT CAGACCAAGC AGCAGGAATT 1300 

CAAAGCT6CT TGCTGTAATT GTGTGAGATT GTGAAGTGGT CTGAATTCTG 1350 

GAATCACAAA CCAACCATGC TGGTGGGCCA TTAAT6GTT6 GAAAACACTT 1400 

CCATCCGGGG CTTTGCCAGA GCGT6CTTTC AAGTGTCCTG GAAATTCTGC 1450 

TGCTTCTCCA AGCTTTCAGA CAAGAATGTG CACTCTCTGC TTAGGTTTTG 1500 

CTTGGGAAAC TCAACTTCTT TCCTCTGGAG ACGGGACATC TCCCTCTGAT 1550 

TTCCTTCTGC TATGCAAAAC CTTTAATCTG CACCTTACAN ACTCGGGGAC 1600 

AAATGGGGAC AGGAAGGATC AAGTTGTAGA GNAGAAAAAG AAAACAAGAG 1650 

ATATACATTG TGATATATAT TAGGGACACT TTCACAGTCC TGTCCTCTGG 1700 

ATCACAGACA CTGCACAGAC CTTAGGGAAA TGGCAGGTTC AAAGTTCCAC 1750 

TTCTTGGTGG GGATGAGAAG GGAGAGAGAG CTAGAGGGAC AAAGAGAATG 1800 

AGAAGACATG GAT6ATCTGG GAGAGTCTCA CTTCGGAATC AGAATTGGAA 1850 

TCACATTCTG TTTATCAAGC CATAATGTAA GGACAGAATA ATACAATAAT 1900 

AAGTCCAAAT CCAACCTCCT GTCAGTGGAA CAGTTATGTT TTATACTCTA 1950 

CAGATTTTAC AAATANATGA GGCTNGTTCC TTGAAAANTG TGTTGNNTTG 2000 

CTGTNGTCCN NTGGAGGAGA CATGAGTTCC GAGATGACCA ACTCNNGCNT 2050 

TGNATNCTNG GAGGNAATAN GGCAGAACCA AAATGACTGT AGAACTTATT 2100 

CTCTGTAGGC CAAATTTCAT TTCAGCCACT TCTGCAGGAT CCTACTGCCA 2150 

ACCTG6AATG GAGACTTTTA TCTACTTCTC TCTCTCTGAA GATGTCAAAT 2200 

CGTGGTTTAG ATCAAATATA TTTCAAGCTA TAAAAGCAGG AGGTTATCTG 2250 

TGCAGGGGGC TGGCATCATG TATTTAGGGG CAAGTAATAA TGGAATGCTA 2300 

CTAAGATACT CCATATTCTT CCCCGAATCA CACAGACAGT TTCTGACAGG 2350 
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CGCAACTCCT CCATTTTCCT CCCGCAGGTG 
CAGTGCCATG ACTGAGAAGG AACCGACCCC 
TTCCCCGAGA ACTTTCTGAT TGCACAGTCT 
TCCTCTTGAA GCATAGCTTT TTAAATATCT 
CTGACTCTAG GAATTCTCTC TTCTGGAATC 
GGTTGCAGTA AGCCAAGG7C ATGCCACTGC 
GCGAGACTCC ATCTCAAAAA AAAAAAAAA 



AGAACCCTGT GGAGATGAGT 2400 

TAGTTGAGAG CACCTTGCAG 2450 

CATTTTGACA GCATGAAATG 2500 

TTTTCCTTCT ACTCCTCCCT 2550 

GCTTGAACCC AGGAGGCGGA 2600 

ACTCTAGCCT GGGTGACAGA ^ 2650 

2679 



(2) INFORMATION FOR SEQUENCE ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 931 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEONESS: double- stranded 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ACTTTCTCTC TCTTTCGATT CTTCCATACT CAGAGTACGC ACGGTCTGAT 50 

TTTCTCTTTG GATTCTTCCA AAATCAGAGT CAGACTGCTC CC6GTGCCAT 100 

GAACGGAGAC GACGCCTTTG CAAGGAGACC CACGGTTGGT GCTCAAATAC 150 

CAGAGAAGAT CCAAAAGGCC TTCGATGATA TTGCCAAATA CTTCTCTAAG 200 

GAAGAGTGGG AAAAGATGAA AGCCTCGGAG AAAATCTTCT ATGTGTATAT 250 

GAAGAGAAAG TATGAGGCTA TGACTAAACT AGGTTTCAAG GCCACCCTCC 300 

CACCTTTCAT GTGTAATAAA CGGGCCGAAG ACTTCCAGGG GAATGATTTG 350 

GATAATGACC CTAACCGTGG GAATCAGGTT GAACGTCCTC AGATGACTTT 400 

CGGCAGGCTC CAGGGAATCT CCCCGAAGAT CATGCCCAAG AAGCCAGCAG 450 

AGGAAGGAAA TGATTCGGAG GAAGTGCCAG AAGCATCTGG CCCACAAAAT 500 

GATGGGAAAG AGCTGTGCCC CCCGGGAAAA CCAACTACCT CTGAGAAGAT 550 

TCACGAGAGA TCTGGACCCA AAAGGGGGGA ACATGCCTGG ACCCACAGAC 600 

TGCGTGAGAG AAAACAGCTG GTGATTTATG AAGAGATCAG CGACCCTGAG 650 

GAAGATGACG AGTAACTCCC CTCAGGGATA CGACACATGC CCATGATGAG 700 

AAGCAGAACG TGGT6ACCTT TCACGAACAT GGGCATGGCT GCGGACCCCT 750 

CGTCATCAGG TGCATAGCAA GTGAAAGCAA GTGTTCACAA CAGTGAAAAG 800 

TTGAGCGTCA TTTTTCTTAG TGTGCCAAGA GTTCGATGTT AGCGTTTACG 850 

TTGTATTTTC TTACACTGTG TCATTCTGTT AGATACTAAC ATTTCATTGA 900 

TGACGAAGAC ATACTTAATC GATATTTGGT T 931 



(2) INFORMATION FOR SEQUENCE ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1692 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double- stranded 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



GATCCCCCGG GCTGCAGGAA TTCGGCACGA GCAAAGGACT TCCTAGTGGG 50 

TGTGAAAGGC AGC6GTGGCC ACAGAGGCGG CGGAGAGATG GCCTTCAGCG 100 

GTTCCCAGGC TCCCTACCTG AGTCCAGCTG TCCCCTTTTC TGGGACTATT 150 

CAAGGAGGTC TCCAGGACGG ACTTCAGATC ACTGTCAATG GGACCGTTCT 200 

CAGCTCCAGT GGAACCAGGT TTGCTGTGAA CTTTCAGACT GGCTTCAGTG 250 

GAAATGACAT TGCCTTCCAC TTCAACCCTC GGTTTGAAGA TGGA6GGTAC 300 

TTGGTGTCCA ACACGAGGCA GAACGGAAGC TGGGGGCCCG AGGAGAGGAA 350 
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GACACACATG CCTTNCCAGA AGGGGATGCC CTTTGACCTC TGCTTCCTGG 400 

TGCAGAGCTC AGATTTCAAG GTGATGGTGA ACGGGATCCT CTTCGTGCAG 450 

TACTTCACAT CTCGTCATGC CCTGTCCACC GTTGTGGACA CCATCTCCGT 500 

CAATGGCTCT GTGCAGCTGT CCTACATCAG CTTCCAGCCT CCCGGCGTGT 550 

GGCCTGCCAA CCCGGCTCCC ATTACCCAGA CAGNNNTCAT CCACACAGTN 600 
GCAGAGCGCC CNCTGGACAG ATGTCTCTAC TCCCGCCATC CCACCTATGA , 650 

TGTACCCCCA CCCCGCCTAT CCGATGCCTT TCATCACCAC CATTCTGGGA 700 

GGGCTGTACC CATCCAAGTC CATCCTCCTG TCAGGCACTG TNCTGCCCAG 750 

TGCTCANGAG GTTCCACATC NAACCTGTGC NCTGGGAACC ACATCGCCTT 800 

CCACCTGAAC CCCCGTTTTG ATGAGAATGC TGTGGTCCGC AACACCCAGA 850 

TCGACAACTC CTGGGGGTCT CAGGAGCGAA GTCTGCCCCG AAAAATGCCC 900 

TTCGTCCGTG GCCAGAGCTT CTCAGTGTGG ATCTTGTGTG AAGCTCACTG 950 

CCTCAAGGTG GCCGTGGATG GTCA6CACCT GTTTGAATAC AACCATCGCC 1000 

TGAGGAACCT GCCCACCATC AACAGACTGG AAGTGGGGGG CGACATCCAG 1050 

CTGACCATGT GCAGACATAG GCGGCTTCCT GGCCCTGGGG CCGGGGGCTG 1100 

GGGTGTGGGG CAGTCTGGGT CCTCTCATCA TCCCCACTTC CCAGGCCCAG 1150 

CCTTTCCAAC CCTGCCTGGG ATCTGGGCTT TAATGCAGAG GCCATGTCCT 1200 

TGTCTGGTCC TGCTTCTGGC TACAGCCACC CTGGAACGGA GAAGGCAGCT 1250 

GACGGGGATT GCCTCCTCAG CCGCAGCAGC ACCTGGGGCT CCAGCTGCTG 1300 

GAATCCTACC ATCCCAGGAY GCAGGCACAG CCAGGGAGAG GGGAGGNGTG 1350 

GGCAGTGAAG ATGAAGCCCC ATGCTCAGTC CCCTCCCATC CCCCACGCAG 1400 

CTCCACCCCA GTCCCAAGCC ACCAGCTGTC TGCTCCTGGT GGGAGGTGGC 1450 

CTCCTCAGCM CCTCCTCTCT GACCTTTAAC CTNACTCTCA GCTTGCACCG 1500 

TGCACCAACC CTTCACCCCT CCTGGAAAGC AGGCCT6ATG GCTTCCCACT 1550 

GGCCTCCACC ACCT6ACCAG AGTGTTCTCT TCAGAGGACT GGCTCCTTTC 1600 

CCAGTGTCCT TAAAATAAAG AAATGAAAAT NCTTGTTGGC AAAAAAAAAA 1650 
AAAAAAAAAC TCGAGGGGCN NCCCNGTACC CAATTCGCCC TA 1692 



(2) INFORMATION FOR SEQUENCE ID NO: 4: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double- stranded 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



ATCTGCAGAA TTC6GCTTCG ATCTAGAACT AGTGGATCCC CCGGGCTGCA 50 

GGAATTCGGC ACGAGCGGTT CCAAGTGGAC TTATTTTGGT CCTGATGGGG 100 

AGAATAGCTG GTCCAAGAAG TACCCGTCGT GTGGGGGCCT GCTGCAGTCC 150 

CCCATAGACC TGCACAGTGA CATCCTCCAG TATGACGCCA GCCTCACGCC 200 

CCTCGAGTTC CAAGGCTACA ATCTGTCTGC CAACAAGCAG TTTCTCCTGA 250 

CCAACAATGG CCATTCAGTG AAGCTGAACC TGCCCTCGGA CATGCACATC 300 

CAGGGCCTCC AGTCTCGCTA CAGTGCCACG CAGCTGCACC TGCACTGGGG 350 

GAACCCGAAT GACCCGCACG GCTCTGAGCA TACCGTCAGC GGACAGCACT 400 

TCTCCGCCGA GCTGCACATT GTCCATTATA ACTCAGACCT TTATCCTGAC 450 

GACAGNACTG CCAGCAACAA GTCAGAAGAC CTCGCTGTCC TGGGTGCTCT 500 

CATTGAGATG GGCTCCTTCA ATCCGTCCTA TGACAA6ATC TTCAGTCACC 550 

TTCAACATGT AAAGTACAAA GGCCAGGAAG CATTCGTCCC GGGATTCAAC 600 

ATTGAAGAGC TGCTTCCGGA GAGGACCGCT GAATATTACC GCTACCGGGG 650 

GTCCCTGATC ACACCCCCTT GCAACCCCAC TGTGCTCTGG ACAGTTTTCC 700 

GAAACCCCGT GCAAATTTCC CAG6AGCAGC TGCTGGCTTT GGAGACAGCC 750 

CTGTACTGCA CACACATGGA CGACCCTTCC CCCAGAGAAA TGATCAACAA 800 

CTNCCGGCAG GTCCAGAAGT TCGNTGAGAG GCTGGTATAC ACCTCCTTCT 850 
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CNCAAGTGCA A6TCTGTACT GCGGCAGGAC 
TCACTGGCCC TGGCTGGCAT TCTTGGCATC 
CATTTGGCTT TTCAGAAGGA AGAGTANCCC 
GTCATTTACA AGCCANCCAC CAAGATGGAG 
TCCCCGGAGC TCCCGGGCAC ATCCAGGAAG 
CACACTTCGG CTCTCTGGAC ACTTGCGACA 
CTCAATCTGC AAACATGCCA GGCCTCAGGG 
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TGAGTCTGGG CAtCATCCTC 900 

TGTATTGTGG TGGTGGTGTC 1000 

CNAAAGGTGA TAACAAGGGA 1050 

ACTGAGGCCC ACGCTTGAGG 1100 

GACCTTGCTT TTGGACCCTA 1150 
CCTCAAGGTG TTCTCTGTAG ~ 1200 

ATCCTCTGCT 1240 



(2) INFORMATION FOR SEQUENCE ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GCCAAATACT TCTCTAAGGA AGAGTG6 27 



(2) INFORMATION FOR SEQUENCE ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

TTCACTGTTG TGAACACTTG CTTTCAC 27 



(2) INFORMATION FOR SEQUENCE ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) T0P0L06Y: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
5 
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(2) INFORMATION FOR SEQUENCE ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: B: 

Arg Leu Gin Gly lie Ser Pro Lys He 
5 



(2) INFORMATION FOR SEQUENCE ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Arg Leu Arg Glu Arg Lys Gin Leu Val 
5 



(2) INFORMATION FOR SEQUENCE ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Lys He Gin Lys Ala Phe Asp Asp He 
5 



(2) INFORMATION FOR SEQUENCE ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Cys Leu Gly Gly Leu Leu Thr Met Val 
5 



(2) INFORMATION FOR SEQUENCE ID NO: 12: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

He Leu Lys Glu Pro Val Gly Val 
5 



SUBSTITUTE SHEET (RULE 26) 



WO 96/40209 



PCT/US96/09726 



33 



(2) INFORMATION FOR SEQUENCE ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ix) FEATURE: The sixth amino acid is Ser, Ly$ or Phe. 
ninth 

amino acid is Val or He 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
5 
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I claim 

1. Isolated nonapeptide which binds to an HLA-A2.1 
molecule, and has fomrula 

Xaa Leu Xaa7 

wherein the sixth amino acid residue is Ser r Lys or Phe, and the 
ninth amino acid residue is Val or lie. 

2* The isolated nonapeptide of which 1, selected from the 
group consisting of: 

SEQ ID NO: 7, 
SEQ ID NO: 8, and 
SEQ ID NO: 9- 

3 . The isolated nonapeptide of claim 1 , which is SEQ ID 
NO: 7. 

4. The isolated nonapeptide of claim 1, which is SEQ ID 
NO: 8. 

5. The isolated nonapeptide of claim 1, which is SEQ ID 
NO: 9. 

6. Isolated protein encoded by the nucleic acid molecule 
of SEQ ID NO: 1. 

7 Method for screening for possible presence of a tumor 
associated protein, comprising contacting a sample with at least 
one of SEQ ID NO: 4 and SEQ ID NO: 5, and determining 
hybridization of SEQ ID NO: 4 or SEQ ID NO: 5 to a target, as a 
determination of said tumor associated antigen in said sample. 

8. The method of claim 7, comprising contacting said 
sample with both SEQ ID NO: 4 and SEQ ID NO: 5. 
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9. The method of claim 8, comprising polymerase chain 
reaction. 

10. Method for determining an immunoreactive substance 
produced by a subject, comprising: 

(a) producing a cDNA library of a cell taken from said 
subject, 

(b) inserting said cDNA library into a vector, 

(c) transfecting said vector into a host cell to produce 
a transfected host cell, 

(d) culturing said transfected host cell to express said 
immunoreactive substance, 

(e) lysing said transfected host cell to form a lysate, 

(f ) contacting a sample containing an immunogenic binding 
partner for said immunoreactive substance with a sample of 
lysed, non-transfected host cells, to remove any immunogenic 
binding partner from said sample which is specific for said non- 
transfected host cell, to produce a stripped sample, 

(g) contacting said stripped sample to a sample of lysed 
host cells transfected with the same vector into which said cDNA 
has been inserted wherein said vector does not contain any cDNA, 
to remove any immunoreactive binding partners specific for said 
vector, thereby producing a twice stripped sample, 

(h) contacting said twice stripped sample to the lysate of 
(f), whereby any immunoreactive binding partners specific for 
said immunoreactive binding partner bind thereto, and 

(i) determining binding in (h) to determine said 
immunoreactive substance . 

11. The method of claim 10, further comprising identifying 
the transfected host cell which expressed said immunoreactive 
substance, and isolating the cDNA contained therein. 

12. The method of claim 10, further comprising removing 
any immunoreactive binding partner to isolate a specific 
immunoreactive binding partner for said substance. 

13. The method of claim 10, wherein said host cell is E. 

coli . 
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14. The method of claim 10, wherein said vector is a viral 
vector . 

15. The method of claim 14 , wherein said viral vector is 
a eukaryotic virus based vector. 

16. The method of claim 14, wherein said viral vecto r is 
a phage vector. 

17. The method of claim 16, wherein said phage is lambda 
phase. 

18. The method of claim 10, wherein said host cell is a 
prokaryote . 

19. The method of claim 10, wherein said host cell is a 
eukaryote . 

20. The method of claim 10, further comprising 
immobilizing said lysate to a solid phase prior to contact with 
said twice stripped sample. 

21. The method of claim 10, wherein said sample is serum. 

22. The method of claim 21, wherein said serum is 
autologous serum. 

23. The method of claim 10, wherein said immunogenic 
binding partner is an antibody. 

24. The method of claim 10, wherein said immunoreactive 
substance is an antigen associated with cancer. 

25. The method of claim 10, wherein said immunoreactive 
substance is an autoimmune associated antigen* 

26. The method of claim 10, wherein said immunoreactive 
substance is produced by a pathogen. 

27* Isolated nucleic acid molecule which codes for a renal 
cell carcinoma-specific antigen, the complementary sequence of 
which hybridizes to SEQ ID NO: l, under stringent conditions. 

28. Isolated nucleic acid molecule which codes for a 
Hodgkin's Disease-specific antigen, the complementary sequence 
of which hybridizes to SEQ ID NO: 2 under stringent conditions. 

29. Isolated nucleic acid molecule which codes for a 
melanoma specific antigen, the complementary sequence of which 
hybridizes to SEQ ID NO: 3 under stringent conditions. 
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30. Isolated nucleic acid molecule of claim 27, consisting 
of SEQ ID NO: 1. 

31. Isolated nucleic acid molecule of claim 28, consisting 
Of SEQ ID NO: 2. 

32. Isolated nucleic acid molecule of claim 29, consisting 
of SEQ ID NO: 3. 

33. Expression vector comprising the isolated nucleic acid 
molecule of claim 27, operably linked to a promoter. 

34. Expression vector comprising the isolated nucleic acid 
molecule of claim 28, operably linked to a promoter. 

35. Expression vector comprising the isolated nucleic acid 
molecule of claim 29, operably linked to a promoter. 

36. Cell line transformed or transfected with the isolated 
nucleic acid molecule of claim 27. 

37. Cell line transformed or transfected with the isolated 
nucleic acid molecule of claim 28. 

38. Cell line transformed or transfected with the isolated 
nucleic acid molecule of claim 29. 

39. Cell line transformed or transfected with the 
expression vector of claim 33. 

40. Cell line transformed or transfected with the 
expression vector of claim 34 . 

41. Cell line transformed or transfected with the 
expression vector of claim 35. 
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